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DETAILED ACTION 
A llowable Subject Matter 

1 , Claims 1-32 would be allowable over the prior art of record if rewritten to overcome any 
objections or rejections under 35 U.S.C. 1 12(2), especially as appearing in this Office action. 
Certain assumptions that make the limitations clear have been considered for the claims, as 
described next or elsewhere in this Office action. 

a. The allowable subject matter of independent claim 1 resides in the whole structure 
and interaction expressed by the combination of all limitations compared to the prior art of record 
that includes the common occurrence matrix having is rows and columns generated as claimed 
based on the co-occurrence matrix. No particular reference provides relevant, objective evidence 
to make the whole structure and interaction of the claimed method obvious by changing the closest 
prior art of record (Schuetze, Foltz, Manning et al.) to include the generation as claimed of rows 
and columns of the common matrix as products based on co-occurrence matrices for documents, 
particularly with document segments (from the documents) to generate the co-occurrence 
matrices. 

b. Independent claim 2, at least sets forth allowable material similar to claim 1 . 

c. Independent claim 3, at least sets forth allowable material similar to claim 1 . 

d. Independent claim 4, at least sets forth allowable material similar to claim 1 . 

Specification 

2. The title is objected to because it is not sufficiently descriptive of the invention. A new 
title is required that is clearly indicative of the invention to which, the claims are directed. See 
MPEP § 606.01 . The Examiner suggests that the Applicant consider a title including these 
elements: "Segment Vector, Co-Occurrence Matrix, and Common Co-Occurrence Matrix for 
Evaluating Commonality of Documents." 



Appuca tion/Control Number: 1 0/694, 773 
Division: 2626 



Pages 



Claim Informalities 

3. Claim 1 , and by dependency claims 5, 9, 1 3, 1 7, 21 , 25, and 29, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the document segment vectors for each of the 
documents in the document set" (in (b)) needs clarification. Because it was previously recited that 
the documents segment vectors were for each of the documents segments, not fir each of the 
documents, it may be unclear as to what element this phrase refers. One of ordinary skill in the art 
may be unsure of whether the recitation of document segment vectors for each of the documents 
further limits the previously recited vectors to somehow correspond to each document, in addition 
to corresponding to terms in the document segment. To further timely prosecution and evaluate 
prior art, the Examiner has interpreted this whole limitation (b) in parallel to limitation (a) as 
"generating for each of the documents of the document set, a co-occurrence matrix firom the 
document segment vectors--. 

4. Claim 1, and by dependency claims 5, 9, 13, 17, 21, 25, and 29, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "such co-occurrence matrices of the respective 
documents" (in (c)) needs clarification. The phrase "such co-occurrence matrices" is not defined 
by the claim, the specification does not provide a standard for ascertaining the requisite degree by 
which co-occurrence matrices are such matrices as the plurality comprised of a co-occurrence 
matrix for each of the documents of the document set, which were recited earlier in the claim, and 
one of ordinary skill in the art may not be reasonably apprised of the scope of the invention 
encompassed by "such co-occurrence matrices". The phrase "the respective documents" is not 
defined by the claim, the specification does not provide a standard for ascertaining the requisite 
degree by which documents are respective. The plain meaning of "respective" requires some 
assumptions about what plural elements are to be considered individually. One of ordinary skill in 
the art would not be reasonably apprised of how to assume the scope of the invention as 
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encompassed by "respective documents". To further timely prosecution and evaluate prior art, the 
Examiner has interpreted this phase as -the co-occurrence matrices*--. 

5. Claim 2, and by dependency claims 6, 10, 14, 18, 22, 26, and 30, are objected to for the 
same two reasons as claim 1 because the limitations are recited using obviously similar phrases. 

6. Claim 2, and by dependency claims 6, 10, 14, 18, 22, 26, and 30, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the co-occurrence matrices of the documents" 
(in (d)) needs clarification. Are these the matrices that were recited earlier as generated for (each 
of) the documents? To further timely prosecution and evaluate prior art, the Examiner has 
interpreted this phase as —the co-occurrence matrices for the documents-. 

7. Claim 2, and by dependency claims 6, 10, 14, 1 8, 22, 26, and 30, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "or the document segments" (in (d)) needs 
clarification. It is not clear why this phrase appears in the claim because it is not clear what 
purpose the phrase serves at its location in the either-or relationship. Because of the either-or 
relationship, the Examiner has interpreted this phase as -or between the document segments-. 

8. Claim 3, and by dependency claims 7, 1 1, 15, 19; 23, 27, and 31, are objected to for the 
same two reasons as claim 1 because the limitations are recited using obviously similar phrases. 

9. Claim 3, and by dependency claims 7, 11, 15, 19, 23, 27, and 3 1, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the co-occurrence matrices of the respective 
documents" (in (d)) needs clarification. The phrase "the respective documents" is not defined by 
the claim, the specification does not provide a standard for ascertaining the requisite degree by 
which documents are respective. The plain meaning of "respective" requires some assumptions 
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about what plural elements are to be considered individually. One of ordinary skill in the art may 
not be reasonably apprised of how to assume the scope of the invention as encompassed by 
"respective documents". To further timely prosecution and evaluate prior art, the Examiner has 
interpreted this phase as —the co-occurrence matrices--. 

1 0. Claim 3, and by dependency claims 7, 11 , 1 5, 1 9, 23, 27, and 3 1 , are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the corresponding component" (in (e)) needs 
clarification. The phrase "the corresponding component" is not defined by the claim for 
components of the common co-occurrence matrix, and one of ordinary skill in the art may not be 
reasonably apprised of the scope of the invention encompassed by a "corresponding component". 
To further timely prosecution and evaluate prior art, the Examiner has interpreted this whole 
limitation as -correcting to become "0" any component of the common co-occurrence matrix of 
mismatch allowance type that corresponds to a component of the co-occurrence count matrix that 
has a value less than a predetermined threshold-. Is this close to what the Applicant intended for 
this subject matter? 

1 1 . Claim 3, and by dependency claims 7, 1 1 , 1 5, 1 9, 23, 27, and 3 1 , are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the document set commonality of mismatch 
allowance type" (in (e)) needs clarification. Because the document set commonality was not 
previously set forth as being of any particular type, especially since it was not set forth s being of 
mismatch allowance type, it may be unclear as to what element this phrase refers. To further 
timely prosecution and evaluate prior art, the Examiner has interpreted this phase as —the 
document set commonality-. 
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12. Claim 4, and by dependency claims 8, 12, 16, 20, 24, 28, and 32, are objected to for the 
same two reasons as claim 1 because the limitations are recited using obviously similar phrases. 
In addition 

In addition, claim 4 is object to because the meaning of the phrase "such document 
segment vectors" (in (b)) needs clarification. The phrase "such document segment vectors" is not 
defined by the claim, the specification does not provide a standard for ascertaining the requisite 
degree by which document vectors are such vectors as those vectors in which components 
corresponding to terms appearing in the documents segment assume a value "1", and the other 
components assume a value "0", which were recited earlier in the claim, and one of ordinary skill 
in the art may not be reasonably apprised of the scope of the invention encompassed by "such 
document segment vectors". 

13. Claim 4, and by dependency claims 8, 12, 16, 20, 24, 28, and 32, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the co-occurrence matrices of the respective 
documents" (in (d)) needs clarification. The phrase "the respective documents" is not defined by 
the claim, the specification does not provide a standard for ascertaining the requisite degree by 
which documents are respective. The plain meaning of "respective" requires some assumptions 
about what plural elenients are to be considered individually. One of ordinary skill in the art may 
not be reasonably apprised of how to assume the scope of the invention as encompassed by 
"respective documents". To further timely prosecution and evaluate prior art, the Examiner has 
interpreted this phase as -the co-occurrence matrices—. 

14. Claim 4, and by dependency claims 8, 12, 16, 20, 24, 28, and 32, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the corresponding component" (in (e)) needs 
clarification. The phrase "the corresponding component" is not defined by the claim for 
components of the common co-occurrence matrix, and one of ordinary skill in the art may not be 
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reasonably apprised of the scope of the invention encompassed by a "corresponding component". 
To further timely prosecution and evaluate prior art, the Examiner has interpreted this whole 
limitation as -correcting to become "0" any component of the common co-occurrence matrix of 
mismatch allowance type that corresponds to a component of the co-occurrence count matrix that 
has a value less than a predetermined threshold--. Is this close to what the Applicant intended for 
this subject matter? 

1 5. Claim 4, and by dependency claims 8, 12, 16, 20, 24, 28, and 32, are objected to under 37 
CFR 1 .75(a) because the meaning of the phrase "the co-occurrence matrix of the document" 

(in (f), two occurrences) needs clarification. Is this the matrix that was recited earlier as generated 
for (each of) the documents? To further timely prosecution and evaluate prior art, the Examiner 
has interpreted this phase as -the co-occurrence matrix for the document-. 

1 6. Claim 5 is objected to under 37 CFR 1 .75(a) because the meaning of the phrase "the 
occurring terms" (second line) needs clarification. Because no occurring terms were previously 
recited, it may be unclear as to what element this phrase refers. To further timely prosecution and. 
evaluate prior art, the Examiner has interpreted this phase as —the appearing terms—. 

1 7. Claim 6 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 

18. Claim 7 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 

19. Claim 8 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 
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20. Claim 9 is objected to under 37 CFR 1 .75(a) because the scope must be interpreted when 
the symbols making up the claim limitations are not defined in the claim. The symbols "r", 
and "5^'' should be defined in the claims at least the first time used, if a concise and accurate 
definition is available. To further timely prosecution and evaluate prior art, the Examiner has used 
the definitions of these terms that appear in claim 5. The Examiner suggests that having this claim 
dependent to claim 5 would provide appropriate definitions for this claim. 

21 . Claim 10 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 6 would provide appropriate definitions for this claim. 

22. Claim 1 1 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 7 would provide appropriate definitions for this claim. 

23. Claim 12 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 8 would provide appropriate definitions for this claim. 

24. Claim 1 3 is objected to under 37 CFR 1 , 75(a) because the meaning of the phrase "the 
corresponding term in the respective document" (third and fourth lines) needs clarification. 
Because no one, particular, corresponding term was previously recited, it may be unclear as to 
which one of the corresponding terms this phrase refers, since (plural) components and terms are 
recited to correspond in claim 1. According to the Examiner's assumption for claim 1, the unclear 



Application/Control Number: 10/694 J7 3 
Division: 2626 



Page 9 



phrase "the respective documents" was not used during the current review and evaluation of prior 
art. Therefore, to further timely prosecution and evaluate prior art, the Examiner has interpreted 
this phase as -each corresponding term--. 

25. Claim 14 is objected to for the same reasons as claim 13 because the limitations are recited 
using obviously similar phrases. 

26. Claim 15 is objected to for the same reasons as claim 13 because the limitations are recited 
using obviously similar phrases. 

27. Claim 16 is objected to for the same reasons as claim 13 because the limitations are recited 
using obviously similar phrases. 

28. Claim 21 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 

29. Claim 22 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 

30. Claim 23 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously . similar phrases. 

3 1 . Claim 24 is objected to for the same reasons as claim 5 because the limitations are recited 
using obviously similar phrases. 
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32. Claim 25 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 21 would provide appropriate definitions for this claim. 

33. Claim 26 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this. claim dependent to 
claim 22 would provide appropriate definitions for this claim. 

34. Claim 27 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 23 would provide appropriate definitions for this claim. 

35. Claim 28 is objected to for the same reasons as claim 9 because the limitations are recited 
using obviously similar phrases. The Examiner suggests that having this claim dependent to 
claim 24 would provide appropriate definitions for this claim. 

36. The Examiner notes, without objection, that the form of the claims does not comply with 
37 CFR 1 .75(g), which requires that dependent claims be grouped together with the claim or 
claims to which they refer to the extent practicable. Correction is NOT required. New claims 
added in compliance with 37 CFR 1.121 that are deemed as not complying with 37 CFR 1.75(g) 
will not be considered a bona fide attempt to provide complete replies to Office communications 
concerning this application. 
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Conclusion 

37. The following references here made of record are considered pertinent to applicant's 
disclosure: 

Schuetze [US Patent 5,675,819] determines lexical co-occurrence of terms by constructing a 

vector representation for each term in whole documents and matrix terms as sums from the 
vectors, 

Tukey et al. [US Patent 5,787,422] determines word overlap for documents from a vector 

representation of sets of words appearing in the documents. 
Foltz et al. [US Patent 6,356,864] stores text passages from documents into a vector representation 

and then into a two-dimensional matrix of frequency of occurrence of terms. 

38. Any response to this action may be mailed to: 

Mail Stop Amendment 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 

or faxed to: 

(571) 273-8300, (for both formal communications intended for entry and for 
informal or draft communications, but please label informal fax as "INFORMAL" 
or "DRAFT") 

Some patent correspondence and/or fees may be submitted using the Office's electronic 
filing system (EFS). See the Office's Internet Web site for additional information, for 
example http:// www. USPTO. gov/ ebc/ ebc_faqs. htm. 

Some patent correspondence may be delivered by hand or delivery services, other than the 
USPS, addressed as follows and brought to U.S. Patent and Trademark Office, Customer 
Service Window, Mail Stop Amendment, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 
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39. Any inquiry concerning this communication or earlier communications from the examiner 
should be directed to Donald L. Storm, of Division 2626, whose telephone number is 
(571) 272-7614. The examiner can normally be reached on weekdays between 7:00 AM and 3:30 
PM Eastern Time. If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Richemond Dorvil can be reached on (571) 272-7602. ^ 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Inquiries regarding the status of submissions 
relating to an application or questions on the Private PAIR system should be directed to the 
Electronic Business Center (EBC) at 866-217-9197 (toll-free) or 571-272-4100 between the hours 
of 6 a.m. and midnight Monday through Friday EST, or by e-mail at: ebc@uspto.gov. For general 
information about the PAIR system, see http://pair-direct.uspto.gov. If you would like assistance 
from a USPTO Customer Service Representative or access to the automated information system, 
call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Donald L. Storm/ 

October 12,2007 

Primary Patent Examiner 
Division 2626 



